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ABSTRACT 



A method and apparatus for online reorganization of a 
database which does not prevent a process from accessing 
the database during the data reorganization. The invention 
uses a non-blocking drain to lock on a database, unloads a 
copy of the data contained in the database, reorganizes the 
copied data,"4oads th'e reorganized data into a shadow 
location, applies log records to the shadow location data so 
that the reorganized data may be adjusted for any changes to 
the database after the data was copied and reorganized, and 
then replaces the data in the database with the reorganized 
data. The non-blocking drain does not prevent other requests 
on the database from being processed while the reorganiza- 
tion lock is in place, except briefly when the reorganized 
data replaces the data in the database. 

20 Claims, 4 Drawing Sheets 
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NON-BLOCKING DRAIN METHOD AND This scattering results in reduced performance as now the 

APPARATUS USED TO REORGANIZE DATA storage device must move between widely scattered physical 

IN A DATABASE locations if logical order operations are to be performed. 

This is true of whatever type of direct access storage device 

RELATED APPLICATIGNS 5 (DASD) is used to store the index or data file. Therefore, the 

^ . , „ ^ . files, including the index file, need to be reorganized peri- 

o ^K? ' '° \ rf 'm^' '?nn°.° "dicaUy so that the logical and physical ordering between 

Sen No. t«/366 564 (parent apphcat.on) filed Dec. 30, 1994. ^^.^ ^ P, correspond. However, 

now abandoned and entitled Interaction Between Applica- ^^^^ ^ ^^ reorganize the files require access to 

uon of a Ug and Maintenance of a Table that Maps Record g,^^ ,^ ^^^j^j^j^^ ^^^^ ^^^^ ,^^ 3^. 

dentifiers During Online Reorganization of a Databi^e ; ^^^.^^ ^^ ^ ^^^^^^ ^^^^ 24x7 avaUabUity, 

U.S patent application Ser^ No. 08/457,150 (child j^^, ■ ,wenty-four hours-a-day, seven days-a-week 

application) now U.S. Pal. No. 5.721 915, a continuation of .ccessibiUty, long durations of data unavailability ate unac- 

the parent application, filed Jun. 1, 1995 and entitled "Inter- ceotable 

action Between Application of a Log and Maintenance of a _ ^ ' , ^ , . ..^ ^ lm- • 

Table that Maps Record Identifiers During Online Reorga- example of a database requiring ?4x7 availability is 

nization of a Database"; and U.S. patent appUcation Ser. No. ^ ^^^^^^^^ ^^^^^^ ^^^""g/ ^^}' ^ ^^^^^^^ 

08/885,955, a FHe Wrapper Continuation (37 C.ER. L62) [^^""'"l reorganization is required to minimize storage over- 

AppUcation of the child appUcation and entitled "Interaction ^ead for the ever changing records. However, a bank cannot 

Between Application of a Log and Maintenance of a Table ^^^^^ to close down record access to reorganize its 

that Maps Record Identifiers During OnUne Reorganization '° Custonier service requires access during the day, 

of a Database", filed Jun. 30, 1997. processing other transactions, commonly occurring at 

night, requires nighttime access. 

BACKGROUND OF THE INVENTION Recognizing that a reorganization utility can be one of the 

. largest inhibitors to data access — reorganization utilities 

L Field of the Invention 25 commonly block access to the data by other utihties and 

The present invention relates to the online reorganization applications during the reorganization process— several 

of data contained in a database. More particularly, the solutions have been proposed to make data available more of 

invention concerns a method and apparatus for reorganizing the time. These "onhne" reorganization methods help mini- 

a database while allowing substantially uninterrupted access mj^e data "outages." 

to the database, ^ major drawback to these techniques is that they require 

2. Description of the Related Art a reorganizational process to request a "blocking drain", also 

Databases are used on computers for a myriad of reasons. known as a lock, on a resource, thereby making other 

In many cases the databases are extremely large, having processes wail. For example, if reorganizational process B 

entries in the millions. With large databases, the information requests a lock, it must wail until a process A, which already 

must be available at all times on a transactional or real time has a lock, finishes. If another process C comes along before 

basis, and large mainframe computers are usually employed process A finishes, process C queues up behind process B 

to access the data. International Business Machines and must wait for both A and B to finish. Once A finishes, 

Corporation, (IBM), assignee of the current invention, has B locks the database and process C continues to wait until 

developed the leading database environment referred to as ^ reorganizational process B can record a starting point, llie 

DB2 for use in conjunction with compatible mainframe wait experienced by process C can be substantial if process 

computers. A is long running or does not complete, or if process C must 

One feature common in all database systems and included also wait for additional processes preceding A to finish, 

in the DB2 is the capability to index various information. Accordingly, there is a need for an online databaser. 

The use of the index allows faster access for searches and 45 reorganization technique using a "non-blocking" drain 

requests based upon the indexed information. DB2 uses a which will wait for a resource without blocking other 

balanced tree index structure. In this structure, root, tree and requests on that resource. Referring to the above example, 

leaf pages are used, with each page at each level containing there is a need for a technique where process C can access 

the same number of entries, except the last one. The leaf the database while process B waits for process A to finish so 

pages are the lowest level and each contains a number of that a reorganization starting point, or logical record 

entries referring to the actual data records contained in DB2 sequence number (LRSN). can be established by B, thereby 

data tables. Each leaf page is maintained in internal logical allowing the reorganization process to begin, 

order automatically by DB2. Tree pages are the next level There is also a need for an online database reorganization 

up, and are used to indicate the logical order of the leaf technique using a non -blocking drain which allows access to 

P^ges. 55 a database during the reorganization of the data and mini- 

For large databases, there may be several layers of tree mizes unavailability of the database in completing the 

pages, for example, a higher level of tree pages referencing reorganization process, 
a lower level of tree pages. Ultimately, the number of tree 

pages is reduced such that all the entries or references fit into SUMMARY OF TOE INVENTION 

a single page referred to as the root page. As in leaf pages, 60 Broadly, the present invention concerns an online data- 

within each tree or root page the entries are kept in logical base reorganization process using a non-blocking drain. The 

order automatically by DB2. process does not block other process's requests for access to 

One problem with this type of index organization is the the target database during the data reorganization, even if the 

physical location of the leaf pages often becomes quite reorganization process is waiting for the target database to 

scattered. Another problem is that the rows of an index — an 65 become available. The reorganization technique only briefly 

index being ordered row by row — may become scattered causes the database to be unavailable when completing the 

acToss multiple data pages, rather than clustered together. reorganization process. 
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In one enabodinoenl, the invention may be implemented to units of work in accordance with one embodiment of the 

provide a method to reorganize a database that does not present invention. 

prevent other processes from accessing the database while DETAILED DESCRIPTION OF THE 

the reorganization is in progress. The method uses a non- PREFERRED EMBODIMENTS 

blocking drain to lock on an original database or "resource", 5 

unloads data to be reorganized from the resource, assigns a ^ Hardware Components & Interconnections 

LRSN, reorganizes the copied data, and loads it into a One aspect of the invention concerns a digital data 

shadow location. Log records may be used to adjust the data processing apparatus 100, which may be embodied by 

in the shadow location to account for changes to resource various hardware components and interconnections. RG. 1 

data that occurred after the data was unloaded. Lastly, the lO shows an example of one such digital data processing 
data in the resource is replaced with the reorganized data. ^ apparatus 100. The apparatus in the illustrated embodiment 

In another embodiment, the invention may be imple-^ includes a client computer 102 and a server computer 104 

mented to provide an apparatus comprising a database and including a processing unit 106, such as a microprocessor or 

a digital processing device. The digital processing device o^^r processing machine, communicatively coupled to a 

may be configured in one embodiment toieceive data from 15 storage unit 108. In the present example, the storage unit 108 

the database and then is used to reorganize and restore the includes a fast-access memory and may include nonvolatile 

database as disclosed immediately above. storage. The fast-access memory preferably comprises ran- 

In still another embodiment, the invention may be imple^ ^"^^^ ^^^^^.^^ P^^f 

mented to provide a signal-bearing medium tangibly instructions executed by the processing unit 106 

embodying a program of machine-readable instructions durmg execution of a program. The nonvolatile storage ma>^ 

executable by a di|tal data processing apparatus to^erform ^ompnse for example one or more magnetic datasoragj 

,u ^ c • • \ * u disks such as a "hard drive or any other suital fl^.to tagg^ 

a method for reorganizmg a database. i- . . • T 

^ * ^ ganaariSiLC ther. the chent computer 102 may mclude m one 

Hie present invention affords its users with a number of ernbSS^nt an output module 112 for outputting/displaying 
distinct advantages. One advantage is that the invention ^5 program status results on a graphic display 116, print mecha- 
provides substantially continuous access to the database data storage medium U8. 
while the reorganization process is executing or waiting to ^ ^^^^^^ -^^j^^^ ^ client/server interface 
execute^ Another advantage is that the mvention provides for ^^^^ ^ ^ ^ ^ ^^^^ electromagnetic link or other 
a non-blocking drain which is different than a blocking ^^^^ exchanging data with the server computer 104. In 
dram. THe non-bloclang dram allows the reorgamzation 3, one embodiment, the server computer 104 may be a main- 
process to lock and queue while earlier-processes- frame computer manufactured by the International Business 
processes which requested database access before the reor- j^^^j^-^^^ Corporation of Armonk, N.Y, and may use an 
ganizauon process^lo complete their routine. At the same ^^Id under trademarks such as MVS. Or, 
time, database access by later-processes, that is, processes ^^^^ ^ ^^.^ 
requesting database access after the reorganization process, 33 ^.^^^^^ ^ ^^^^ ^^^^^^ 53^ 
IS not mipeded by the non-blocking dram. ^^^^-^^ ^^8 MB of main memory running AIX 3.2.5. 

Another advantage of the present invention is that the y^e server computer 104 may incorporate a database 

only time the database is inaccessible to processes other than system, such as DB2 or ORACLE, or it may access data on 

the reorganization process is briefly when the data in the gies stored on a data storage medium such as disk, e.g., a 2 

original database is replaced with the reorganized data, 40 gB SCSI 3.5" drive, or tape. In another embodiment, the 

Furthermore, the invention also provides a number of other server computer 104 may comprise one or more magnetic 

advantages and benefits, which should be apparent from the data storage disks commonly referred to as direct access 

following description of the invention. storage devices (DASD), 

BRIEF DESCRIPTION OF THE DRAWING ^ that, through appropriate data access 

45 programs and utilities 120, the client computer 102 may 

The nature, objects, and advantages of the invention will access one or more databases 122 and/or flat files (i.e., text 

become more apparent to those skilled in the art after files) 124 which contain data. As is well known in the art, a 

considering the following detailed description in connection database may contain one or more partitions and tablespaces 

with the accompanying drawings, in which like reference 128 which are logical divisions of the database, 

numerals designate like parts throughout, wherein: 50 Despite the specific foregoing description, ordinarily 

FIG. 1 is a block diagram of the hardware components skilled artisans (having the benefit of this disclosure) will 

and interconnections of a digital data processing apparatus recognize that the apparatus discussed above may be imple- 

used to reorganize data in a database in accordance with the mented in a machine of different construction, without 

invention; departing from the scope of the invention. Other computers, 

FIG. 2 is an exemplary example of a signal-bearing servers, computer architectures or database systems than 

medium embodying a program of machine-readable instruc- ^^^se discussed may be employed. As a specific example, 

tions executable by the digital data processing apparatus of "^"'^ °f components of output module 112 may 

FIG. 1 in accordance with the invention; eliminated; furthermore, the storage unit 108 may be 

FIG. 3 shows a sequence of method steps generally provided on-board the processing u^^^^^ 

illustrating one embodiment of the method of the present '° ^/^^"y f computer 102 or the apparatus 100. 

invention- Furthermore, the client computer 102 may be combined with 

^' . „ . , - . , ^ the server computer 104 as one unit. 

FIG. 4 IS a flowchart or an operational sequence for 

reorganizing data in a database in accordance with the Operation 

invention; and 65 In addition to the various hardware embodiments 

FIG. 5 shows the timing of the unload step in the dcsaibcd above, a different aspect of the invention concerns 

operation sequence of FIG. 4 with respect to different logical a method for reorganizing a database. 
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Signal-Bearing Media 

Such a method may be implemented, for example, b 
operating the apparatus 100 to execute a sequence of 
machine-readable instructions. These instructions may 
reside in various types of signal -bearing media. In this 
respect, one aspect of the present invention concerns a 
programmed product, comprising signal -bearing media tan 
gibly embodying a pro gram of m achine-readable instruc 
tions es^^uj^y^^^^^^M^^^^igess^ perform 
method^Sid^amzea^atabase 

This signal -bearing media may comprise, for example, 
RAM (not shown) contained within the apparatus 100. 
Alternatively, the instructions may be contained in another 
signal-bearing media, such as a magnetic data storage dis- 
kette 200 (FIG, 2), directly or indirectly accessible by the 15 
apparatus 100. Whether contained in the apparatus 100 or 
elsewhere, the instructions may be stored on a variety of 
machine -readable data storage media, such as DASD stor- 
age (e.g., a conventional "hard drive" or a RAID array), 
magnetic tape, electronic read-only memory (e.g., ROM, 20 
EPROM, OR EEPROM), an optical storage device (e.g., 
CD-ROM, WORM, DVD, digital optical tape), paper 
"punch" cards or other suitable signal-bearing media includ- 
ing transmission media such as digital and analog and 
communication links and wireless. In an illustrative embodi- 25 
ment of the invention, the machine-readable instructions 
may comprise lines of compiled C++ language code, or any 
other computer language or code capable of communicating 
implementation instructions. 
Overall Sequence of 9^^^^^ 

HG. 3 shows a sequence of method steps 300 to illustrate 
a general example of the method aspect of the present 
invention. For ease of explanation, but without any limita- 
tion intended thereby, the example of FIG. 3 is described in 
the context of the apparatxis 100 described above. 35 

A logical database 302 shown in FIG. 3 may comprise one 
or more physical databases and represents hierarchic, struc- 
tured relationships between data segments that can differ 
from the physical structure in which the data segments were 
loaded in the physical database 122, In one embodiment, the 40 
database 302 may be logically divided into one or more 
tablespaces where each tablespacc is comprised of one or 
more partitions 304. A tablespace 306 may comprise a set of 
pages 308 used to store the records of one or more tables in 
a DB2 program; that is, they contain data segments of some 45 
number of fixed-length blocks. A table in the preferred 
embodiment refers to an array of data where each item in the 
array can be unambiguously identified. 

Generally, the logical pages 308 do not need to be stored 
in any particular order in a physical storage unit housing a 
database. However, the logical database 302 must be able to 
find the location of a given page 308 on the physical storage 
unit at any time. As is well known in the art, this is 
accomp lished by constructing an index representing each 
f>i^eof||ggg[aja^ase . Each entry in the gngte- KiBEmta jjT^ a 
pointer to the physical location of a related page in the 
physical database. For example, the first index entry con- 
tains a pointer to the first page in the physical database 122, 
and the scconcyndex^tn^entains a pointer to the second 
page in the physical a ataoas? 122 . Typically, the order of 
pages 308 in the logical database 302 does not correspond to 
the physical order in which the pages appear within the 
physical database 122. ^-^^^^ 

In one embodiment of the present invention, a daj 
rfe^SPM^filfWf^^^uggt^i^ta^ 
as dis cusse d in detail below, places a^^^pcUhe. 
ThejS^t^u^igh'flllgjgi^t^ apparatus 100 that a "requesT fo: 



database access has been made so that the reorganization 
process may proceed. Additionally, changes to the format- 
ting of subsequently generated log records may occur to 
allow compatibihty with the present invention's log record 
processing as discussedbelow. A data set subject to reorga- 
nization ^arggji f^S^B^^ such as a set of pages 308, is 
"unloaded" from the logical database 302. In the preferred 
embodiment, unloading a data set refers to copying a data set 
to the flat files 124 that contains a copy of the target data 
records, and may be used in subsequent steps as a primary 
source of input for recreating the database. Alternatively, the 
data set could be copied to storage unit 108 via data access 
120 if storage unit 108 is provided with sufiBcient capacity. 

During the unload phase, it is determined if any processes 
are using or requesting access to the database. A LRSN, or 
logical record sequence number, is recorded for each such 
process and logically marks or "timestamps" when database 
access was first requested by each process. This is used by 
the present invention to distinguish which processes 
requested database access before the reorganization process 
was begun and are still actively using the database, and 
which processes either began after the reorganization pro- 
cess and are active or have made an access request but are 
queued and waiting to gain access to the database. 

After LRSN's have been assigned, the unloaded jgfec.l 
dSlSISSIK^e organized by the processor 106 and loaded into 
a shadow location 310 of the storage unit 108. As mentioned 
above, various hardware devices well known in the art may 
be used tc HiJgiifeutmitte^me thod of the present invention, and 
its implementation should not be limited to the specific 
example discussed in relation to any one embodiment. In the 
preferred embodiment, the shadow location 310 serves as a 
temporary working location. Log records 312, reflecting 
changes which occurred to the original data set after the 
target data set was unloaded until the present time, are then 
applied to the target data set in the shadow location 310. The 
log records update the target data to the logical equivalent of 
the original data set although in the preferred embodiment, 
the target data set remains in substantially reorganized form. 
In another embodiment, greater degrees of disorganization 
may occur to the target data set and further reorganization 
may be required. In any case, after updating the target data, 
the original data set is then replaced with the target data set. 

FIG. 4 shows a more detailed sequence of method steps 
400 to illustrate one example of the method aspect of the 
present invention. For ease of explanation, but without any 
limitation intended thereby, the example of FIG. 4 is also 
described in the context of the apparatus 100 described 
above. The steps are initiated in task 402, when a data 
reorganization reqg^lSsireeei^TibyltSgiBi^^irgOg^ 
request may be communicated using various mediums stan- 
dard to computer systems, for example, manually by a user 
via a keyboard (not shown), or automatically using com- 
mands stored in' the storage unit 108 and activated at a 
predesignated time." 

Following a reorganization request, a "non-blocking" 
drain in task 404 places a lock on a new target data set 
contained in theldatabase 122. The non-blocking drain does 
not acquire a "traditional" |l2^iESSffiSargg^5taia2Set^ 
wherein a lock is generafly understood to be a serialization 
mechanism by which a data seU^^Lg^je^<?rwise^0nl5!^by 
the holder of the lock. With tn^wblockingTlrain, any 
requests to access the£lOTgg3^51i§^^|S|j not be blocked 
although the sequence in which a lock was requested is still 
recorded. This allows other processes that need to use the 
target data set to access the data set even when the reorga- 
nization process is taking place. 
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Traditionally, a blocking drain has been used to request 
lock on a target data set for data reorganization purposes. 
The process requesting the lock, B, would have to wait for 
the target data set if another process, A, already had a lock^ 
on the target data set. If another process, C, came along am 
requested access to the target data set, it would be placed ii 
a queue behind B waiting for access to the data set. Assumj 
ing the reorganization process B and process C were queuei 
and waiting behind A for the target data set, the unload phas^ 
shown as task 408 would have to wait to use the target datj 
set until all active logical work units (LUW) of process M 
were compl eted, where a L UW inc. 
e t ^eehi 

apparatus 100. "Olrviously, the problem occurs because the 
reorganization process is requesting access to the same table 
space being used by the LUW that is currently active, llie 
resulting waiting period could be of a very long duration if 
the original active LUW's were long running or did not 
complete. 

In the present invention, a delay starting the unload phase 
shown in task 408 of the reorganization process is tolerable, 
but preventing other processes from accessing the target data 
set is not tolerable. By using a non-blocking drain in the 
present invention, the drawbacks of the traditional blocking 
drain lock are overcome because the reorganization process 
of the present invention will wait for a target data set without 
blocking other requests on that data set. The other processes 
pay little notice to the existence of the non-blocking drain 
and are not prevented from gaining access to the target data 
set. One exception is that these other processes may change 
their generation of log records so that the reorganization 
process can successfully process them. For example, the 
format in which the log records are generated may change so 
that the records can be successfully processed by the inven- 
tion. 

After the method has quiesed until all active LUW's for 
processes requesting data access before the reorganization 
process are completed as shown in task 406, unloading of 
the data in task 408 commences and comprises assigning a 
LRSN to each change made to the database. An LRSN is 
associated to each change to optimize integration of the 
changes. For example, a log record may use the LRSN to 
distinguish whether an update to a page of the database has 
already been completed. Using a single data set as an 
example, the unload will pick up data record changes made 
at or after a time t2, as shown in FIG. 5, where t2 represents 
the point where the reorganization unload LRSN is assigned, 
and FIG. 5 shows a timeline representation of the data 
unload step 408. As shown in FIG. 5, the unload step 408 in 
one embodiment of the present invention begins at time t2 
(LSRN=t2) and extends to time t7 representing the comple- 
tion of the reorganization process. 'I'he data unload 408 will 
pick up data records starting at time t2, and then conduct a 
scan of the entire tablespace to determine changes to the 
database. The time t2 is also a marking point for imple- 
menting log records, discussed below. 

Returning to FIG. 4, after thellfcgfPdai^^gllh^^ 55 
unloaded, the data is ordered in logical sequence in task 4107 
reorganized in task 412, and loaded into a shadow location 
in task 414. The target data set may include data ^n^^s^if^ 
which, after the target data set has been reorganized, may be 
rebuilt into reorganized data 1^^^^ task 416. Rebuilding 60 
the data indexes is necessary in the preferred embodiment as 
discussed above so that quick access to the reorganized data 
may occur. Log records are applied to theitM;g eaida;ta |ggigng\^ 
the shadow location in task 418 which allows any changes 
to the original data set which occurred while the reorgani- 65 
zation was taking place to be applied to the reorganized 
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The processing of the log records (logs) for changes made 
to the original database by various LUWs is handled as 
follows. Referring to FIG. 5, the logs will not pick up 
changes made to the database by LUWl 504 and update the 
reorganized data because LUWl 504 has completed before 
reorganization 408 began unloading the target data set 
records. Any changes which occurred due to LUWl 504 will 
be picked up during the reorganization unload 408 of the 
records. 

However, LUW2 506, which started after time t2, is 
treated differently than LUWl 504 by the reorganization 
process 400. All changes to LUW2 506 will be read as part 
f the log apply 418 shown in FIG. 4. If the timing is such 
hat only readj 



ter time t2 and 

before X4 — maHang the^en"d lof XUW2'506, then the logs 
need not include any updated data concerning LUW2 506. 
If the unload of task 408 missed some or all of the data 
changes occasioned by LUW2 506, for example, due to 
format differences, then those changes will be appUed to the 
target data set by the log apply of task 418. 

Furthermore, LUW3 508 started before the reorganization 
unload 408. In this case, in order to insure that all changes 
which occurred to the original data set between the time the 
reorganization unload began and the time the reorganization 
process 400 has completed, the present invention waits tmtil 
time t4 for the reorganization unload 408 to begin. The time 
t4 represents the time when LUW3 508 has completed, 
thereby allowing all changes made by LUW3 508 to the 
original data set to be incorporated in the target data set. 
Essentially, waiting until time t4 for all LUWs active when 
the reorganization began to complete before beginning the 
unload phase insures that all of the changes occurring from 
LUW3 508 are processed properly during the unload step of 
reorganization 400. If the unload phase began before LUW3 
completed, the log format of LUW3 could prevent the 
changes made to the database by LUW3 508 from being 
picked up by the logs. By waiting, the changes will be 
identified by the reorganization unload 408. Because a 
non-blocking drain lock is used by the present invention 
during this waiting period from t2 when the reorganization 
request was made, until the time t4, when all logical units of 
work which were active prior to time t2 have completed, 
potential data unavailability to any process seeking access to 
the original data set between time t2 and t4 is avoided. 
Additional LUWs pay no notice to the existence of the 
non-blocking drain placed on the original data set, and are 
able to immediately access the data unimpeded. The only 
activity which is held in abatement is the beginning of the 
reorganization process 400. This process of determining 
when a LUW of a given process began relative to the unload 
408 is shown in task 410 as determining the order of the 
logical sequence of LUWs. 

The log apply of task 418 is disclo.sed in U.S. patent 
application Ser. No. 08/366,564 (parent application) filed 
Dec. 30, 1994 and entitled "Interaction Between Application 
of a Log and Maintenance of a Table That Maps Record 
Identifiers During Online Reorganization of a Database" and 
is further discussed in U.S. patent application Ser. No. 
08/457,150 (child application), a continuation of the parent 
application, filed Jun. 1, 1995 and entitled "Interaction 
Between Application of a Log and Maintenance of a Table 
lliat Maps Record Identifiers During Online Reorganization 
of a Database" and further discussed in U.S. patent appli- 
cation Ser. No. 08/885,955, a File Wrapper Continuation (37 
C.F.R. 1.62) Application of the child application and entitled 
"Interaction Between Application of a Log and Maintenance 
of a Table That Maps Record Identifiers During Online 
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Reorganization of a Database", filed Jun. 30, 1997. Each of 
the these applications is incorporated by reference herein. 
The log apply task of 418 updates the shadow version data 
to be logically equivalent with the original data set, although 
the shadow version remains in reorganized form. If a scan of 
the target data indicates that all data updates have not been 
included in the shadow version in task 419, then the method 
returns to and repeats task 418. Otherwise, the method 
continues to task 420. 

The preferred method of the present invention may be 
practiced on both compressed and uncompressed data sets as 
shown in task 420. If the target data set does not comprise 
compressed data, the processor 106 replaces the original 
logical data set with the reorganized target data set in task 
422, which is used to reorganize the physical database 122. 
Briefly during this replacement step, neither the original nor 
the reorganized data is accessible by a process request. After 
the reorganization of a database has occurred, the method 
ends in task 426. 

If the target data set comprises compressed data, then a 
new dictionary for the compressed data is built in task 424. 
The use of such a dictionary is well known in the art, and is 
required to "de-compress" the compressed data. Because the 
data was reorganized, using the present invention, a new 
dictionary is required. Similar to the process as apphed to 
uncompressed data, the compressed reorganized target data 
set replaces the compressed original data set in task 422. 
Again, the respective data sets are locked, thereby blocking 
any process request to access the data. The physical database 
122 is updated and the method ends in task 426. 

Other Embodiments 

While there have been shown what are presently consid- 
ered to be preferred embodiments of the invention, it will be 
apparent to those skilled in the art that various changes and 
modifications can be made herein without departing from 
the scope of the invention as defined by the appended 
claims. 

What is claimed is: 

1. A method for use in reorganizing a resource including 
data, the method comprising: 

in response to a reorganization request, using a non- 
blocking drain to lock on the resource; 

delaying processing of the reorganization request until the 
completion of processing of one or more requests on 
the resource made prior to the reorganization request; 

processing one or more requests on the resource made 
during the delay in processing of the reorganization 
request; 

after the delay in processing the reorganization request, 
processing the reorganization request by: 
unloading a copy of the data from the resource; 
reorganizing said copied data; 
loading said reorganized data into a shadow location; 
applying log records to said shadow location data, said 
log records adjusting said reorganized data for 
changes occurring to the resource after said data was 
unloaded; and 
replacing said data in the resource with said reorga- 
nized data. 

2. The method recited in claim 1, further comprising: 
building reorganized data indexes; and 

replacing resource data indexes with said reorganized data 
indexes. 

3. The method recited in claim 2, including determining 
a logical record sequence number, said logical record 
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sequence number used to mark a logical time, wherein 
unloading said data and determining said logical record 
sequence number is queued until all processes utilizing said 
resource which were active when said lock was established 
5 have completed. 

4. The method recited in claim 3, wherein said logical 
record sequence number is determined during the unloading 
of said data from the resource. 

5. 'llie method recited in claim 4, wherein said data is 
10 compressed data and the method includes building a new 

dictionary after said data is reorganized. 

6. A signal-bearing medium tangibly embodying a pro- 
gram of machine-readable instructions executable by a digi- 
tal processing apparatus to perform a method for reorganiz- 

15 ing a resource including data, said method comprising: 
in response to a reorganization request, using a non- 
blocking drain to lock on the resource; 
delaying processing of the reorganization request until the 
completion of processing of one or more requests on 
the resource made prior to the reorganization request; 
processing one or more requests on the resource made 
. during the delay in processing of the reorganization 
request; 

^ after the delay in processing the reorganization request, 
processing the reorganization request by: 
unloading a copy of the data from the resource; 
reorganizing said copied data; 

loading said reorganized data into a shadow location; 
3Q applying log records to said shadow location data, said 
log records adjusting said reorganized data for 
changes occiu-ring to the resource after said data was 
unloaded; 

replacing said data in the resource with said reorga- 
nized data; and 
wherein changes due to the processing of the requests 
made prior to the reorganization request and during the 
delay in processing the reorganization request are 
included in the processing of the reorganization 
40 request. - 

7. The signal-bearing medium for performing the data 
reorganization method as recited in claim 6, further com- 
prising: 

building a reorganized data index; and 
45 replacing a resource data index with said reorganized data 
index. 

8. The signal-bearing medium for performing the data 
reorganization method as recited in claim 7, including 
determining a logical record sequence number, said logical 

50 record sequence number used to mark a logical time, 
wherein unloading said data and determining said logical 
record sequence number is queued until all processes uti- 
lizing said resource and which were active when said lock 
was established have completed. 

55 9. The signal-bearing medium for performing the data 
reorganization method as recited in claim 8, wherein said 
logical record sequence number is determined during the 
unloading of said data from the resource. 

10. The signal-bearing medium for performing the data 
60 reorganization method as recited in claim 9, wherein said 

data is compressed data and the method includes building a 
new dictionary after said data is reorganized. 

11. A digital processing machine used to reorganize a 
database, the device comprising: 

65 a database; 

a digital processing apparatus communicatively coupled 
to said database, the digital processing apparatus con- 
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figured to receive data from said database, and further 
configured lo receive data and commands from a 
signal-bearing medium tangibly embodying a program 
of machine-readable instructions executable by the 
digital processing apparatus and used to reorganize said 5 
database by: 

in response to a reorganization request, using a non- 
blocking drain to lock on the resource; 

delaying processing of the reorganization request until 
the completion of processing of one or more requests lO 
on the resource made prior to the reorganization 
request; 

processing one or more requests on the resource made 
during the delay in processing of the reorganization 
request; 15 

after the delay in processing the reorganization request, 
processing the reorganization request by: 
unloading a copy of the data from the resource; 
reorganizing said copied data; 

loading said reorganized data into a shadow location; 20 
applying log records to said shadow location data, 
said log records adjusting said reorganized data 
for changes occurring to the resource after said 
data was unloaded; 
replacing said data in the resource with said reorga- 25 
nized data; and 
wherein changes due to the processing of the requests 
made prior to the reorganization request and during the 
delay in processing the reorganization request are 
included in the processing of the reorganization 
request. 

12. The digital processing machine recited in claim 11, 
further comprising: 

building a reorganized data index; and 
replacing a resource data index with said reorganized data 
index. 

13. The digital processing machine recited in claim 12, 
including determining a logical record sequence number, 
said logical record sequence number used to mark a logical ^ 
time, wherein unloading said data and determining said 
logical record sequence number is queued until all processes 
utilizing said resource and which were active when said lock 
was established have completed, 

14. The digital processing machine recited in claim 13, 
wherein said logical record sequence number is determined 
during the unloading of said data from the resource. 

15. The digital processing machine recited in claim 14, 
wherein said data is compressed data and the method 
includes building a new dictionary after said data is reor- 
ganized. 



16. An apparatus for reorganizing a database, the appa- 
ratus comprising: 

a digital processing apparatus configured to receive data 
from a database, and further configured to receive data 
and commands from a signal-bearing medium tangibly 
embodying a program of machine-readable instructions 
executable by the digital processing apparatus and used 
to reorganize said database; 

means for storing said database; 

non-blocking drain means for accessing and locking said 
database in response to a reorganization request; 

means for delaying processing of the reorganization 
request until the completion of processing of one or 
more requests on the resource made prior to the reor- 
ganization request; 

means for processing one or more requests on the 
resource made during the delay in processing of the 
reorganization request; 

means for processing the reorganization request after the 
delay in processing, including: 
copying means for copying data from said database; 
means for reorganizing said copied data; 
means for loading said reorganized data into a shadow 
location; 

log means for applying log records to said shadow 
location data, said log records adjusting said reorga- 
nized data for changes occurring to said database 
after said data was unloaded; and 

means for replacing said data in said database with said 
reorganized data. 

17. The apparatus recited in claim 16, including means for 
determining a logical record sequence number, said logical 
record sequence number used to mark a logical time within 
said copied data, and wherein means for copying said data 
and determining said logical record sequence number is 
queued until all processes utilizing said database and which 
were active when said lock was established have completed. 

18. The apparatus recited in claim 17, wherein said logical 
record sequence number is determined during unloading of 
said data from said database. 

19. The apparatus recited in claim 18, further comprising: 
means for building reorganized data indexes; and 
means for replacing database data indexes with said 

reorganized data indexes. 

20. The apparatus recited in claim 19, wherein said data 
is compressed data and the apparatus further comprises a 
means for building a new dictionary for said compressed 
data after said data is reorganized. 
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